Sequential neural text compression
نویسندگان
چکیده
The purpose of this paper is to show that neural networks may be promising tools for data compression without loss of information. We combine predictive neural nets and statistical coding techniques to compress text files. We apply our methods to certain short newspaper articles and obtain compression ratios exceeding those of the widely used Lempel-Ziv algorithms (which build the basis of the UNIX functions "compress" and "gzip"). The main disadvantage of our methods is that they are about three orders of magnitude slower than standard methods.
منابع مشابه
Text Compression Via Alphabet Re-Representation
This article introduces the concept of alphabet re-representation in the context of text compression. We consider re-representing the alphabet so that a representation of a character reflects its properties as a predictor of future text. This enables us to use an estimator from a restricted class to map contexts to predictions of upcoming characters. We describe an algorithm that uses this idea...
متن کاملSyntactically Informed Text Compression with Recurrent Neural Networks
We present a self-contained system for constructing natural language models for use in text compression. Our system improves upon previous neural network based models by utilizing recent advances in syntactic parsing – Google’s SyntaxNet – to augment character-level recurrent neural networks. RNNs have proven exceptional in modeling sequence data such as text, as their architecture allows for m...
متن کاملSequential Recurrent Neural Networks for Language Modeling
Feedforward Neural Network (FNN)-based language models estimate the probability of the next word based on the history of the last N words, whereas Recurrent Neural Networks (RNN) perform the same task based only on the last word and some context information that cycles in the network. This paper presents a novel approach, which bridges the gap between these two categories of networks. In partic...
متن کاملCompressing Texts with Neural Nets
Neural networks are promising tools for data compression without loss of information. We combine predictive neural nets and statistical coding techniques to compress text les. We apply our methods to certain short newspaper articles and obtain compression ratios exceeding those of widely used Lempel-Ziv algorithms (which are the basis of the UNIX functions \compress" and \gzip"). Our methods' m...
متن کاملAn Experimental Assessment of Compression Ratio and Evaluation of Aluminium Cylinder Head in Bi-Fuel(Gasoline+CNG) Engines
The first step in dealing with the behavior of an engine on NG mode is the installation of a suitable gas fueling system on the engine. Two generations of gas fueling systems which were studied and installed on research engine were Mixer type and Sequential system type. The results, including performance, emissions and fuel consumption, were recorded and analyzed. The results showed that the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE transactions on neural networks
دوره 7 1 شماره
صفحات -
تاریخ انتشار 1996